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The Ml 76.2 gene is located on chromosome n. Regulatory sequences can be found e g in the 
region between 9416340 and 9415915. An exemplary sequence of tiiis region is as follows: 

GACAATTATCGATTAATAAAAGTTTiW^CAGACT^CGAGAA^^TTAAATATA^^ 
GTTGTTTTGTGTGTAGAAAAATAATTTTGATAGAAACi^^AAAATTAG 

TACTCGTGCACGAGATGTGCGCCAGCAGCTCCTTGACGCAAAACGTGACGTTTAGCACCAA AATGATTTT 

-378. 

TGCTCTTTGAGTTCTTTGTTTTCGGGAGCAAATTTCATGCCAATCCCTTTCTTTTO^ ' ' 

TTAAATTCATGTj^VTAACTATTATTCATGTCAATTACAAgAARTJAGCATCCAAGAT TTTATCATA AACT 

"^^243 • ■ 

■ . 

CGTTCAAACCTCCTTTTACC^CTCGAAAAGCa^ATATCTCCGACTTCCTTCaAAGAGAAAT GAT 

. . • . -169. 

CATAGAAACCTCACGTTATACGTTTTGTCMCACGATTTCAGTGCTC^^ 

TAATTTaVTTTTTGTaVCTCTCGCGTC:7VTGTTTTGCATTTTTCGAAAGas.TTTATTTAAAACTGAAAAAA 
TAATTCGTAATTTTTCAAGAATCGCT 
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Exemplary Ml 76,1 sequences: 




MAQKDDRILL 
ASLYHFIAYE 
MGAHALRLTE 
LGVPMTCIIR 
KTQQVLSEDG 
LPEEERGAFI 
VDTPFLYEFI 



LNAPRLPLED 
FDFLIDIHKN 
WHIRVLKALN 
RNLTQCYEQL 
VLERFIGKPR 
I^KLKPMXI 



DKLNELTADIi 
WKTDDFTRN 
ISDDVIQRAI 
SLNDRSDLMI 
EARDIRA3FA 
ENYLVLAKKP 



HDWAHANGLV 
MVEILKKVKA 
PENKPIPMIA 
DGRQVAIVYF 
GMWALENTDE 
1TFA£CAVSEL 



MRLSTDKLSS 
QGLKQPVTIiA 
EALFECAWSHF 
RAGYSPDHYP 
VTMKWAGAQ 
GVYGYAFGRK 



EVCQTTPLTL 
IQRSDYMCHK 
SNPAAWLW 
STKEWEARER 
KHPEAFVLKP 
DAPELKTAGH 



LPSPFPKNVF 
DQYSAEYGLK 
VENVNQNQID 
MELSTAliCTP 
QTEGGAALHT 
LLRTKPESTA 



EEAVHIQNLF 
QIEINNIASS 
QRHVEYELEK 
WIGLQVANTK 
GDEMVQMLRE 
MGGVAAGHAV 




aaagaATGGCT CAAAAAGAT6 ACCGGATTTT GCTGTTGAAT 
TCACCGCTGA TCTTCACGAT TGGGCTCATG CTAATGGGCT 
TGTCAAACTA CTCCATTAAC ACTTCTTCCA TCTCCATTCC 
TTTCGCAAGT CTTTATCACT TCATAGCTTA TGAATTTGAT 
ATTTCACACG GAATATGGTT GAGATCTTGA AGAAAGTCAA 
CGATCTGATT ATATGTGTCA TAAGGATCAA TATTCAGCGG 
GTCAATGGGA GCACATGCTC. TACGGCTCAC CGAATGGCAT 
TTCAT^GAGC 7\ATTCCAGAA AACAAGCCAA TTCCAATGAT 
CCAGCAGCTG TGGTTCTTGT CGTTGTAGAA AACGTCAATC 
AAAGTTAGGA GTACCGATGA CATGTATTAT TAGAAGAAAT 
GCGATTTGAT GATTGATGGG CGTCAAGTAG CAATTGTTTA 
AAAGAATGGG AAGCACGTGA GCGTATGGAA CTTTCCACCG 
TAAGAAGACC CAGCAGGTTC TTTCTGAAGA TGGAGTACTC 
GAGCTTCATT CGCAGGAATG TGGGCTTTGG AGAACACTGA 
.CCAGAAGCGT TTGTTCTGAA GCCACAAACT GAAGGTGGAG 
AGAACTTCCG GAAGAAGAGC GTGGAGCTTT CATTTTGATG 
TTGCAAAGAA GCCGATCACA TTTGCTAAGG CTGTTAGTGA 
CCTGAGCTTA AGACTGCTGG GCATTTGCTC CGAACGAAAC 
TGTTGTCGAC ACCCCATTCC TCTACGAATT TATTTGAttt 
aaaccatttg ctatttagat ctttttgtgt ttgtaaattt 




GCTCCAAGGC TCCCGCTCGA AGATGATAAG 
TGTCATGCGT CTATCAACCG ACAAGTTGAG 
CGAAAAATGT TTTTGAAGAA GCAGTTCATA 
TTTCTAATCG ATATTCATAA AAATGTCGTG 
AGCCCAAGGA CTCAAGCAAC CAGTCACTCT 
AATATGGACT GAAACAAATT GAAATAAACA 
ATCAGAGTTC TTAAAGCGTT GAACATTTCC 
CGCTGAAGCT TTATTCAAGG CCTGGTCCCA 
AAAATCAGAT TGATCAACGC CACGTGGAAT 
TTAACACAAT GCTATGAACA ATTATCATTG 
CTTCAGAGCA GGATACTCAC CTGATCATTA 
CTATCAAAAC TCCATGGATC GGGCTACAGG 
GAAAGATTCA TCGGAAAACC ACGAGAAGCT 
TGAAGTGACT ATGAAAGTCG TGGCTGGAGC 
CCGCATTGCA CACCGGTGAT GAGATGGTTC 
GAGAAACT6A AACCGATGAT TATTGAAAAC 
ACTTGGAGTG TATGGTTATG CATTTGGAAG 
CGGAATCCAC AGCTATGGGT GGAGTAGCCG 
cgaacataat cagaaaactc aacaaaaatg 
aatcattgta atttattgaa tgt 



CTCAACGAGC 
CAGCGAAGTT 
TTCAGAACCT 
AAAACTGATG 
CGCGATTCAA 
ATATCGCCTC 
GATGACGTCA 
CTTTTCGAAC 
ATGAACTTGA 
AATGATAGAA 
TCCATCTACA 
TGGCAAATAC 
CGCGATATTC 
TCAAAAACAT 
AAATGCTCCG 
TACCTGGTTC 
GAAGGATGCA 
CCGGACATGC 
ctgtgatatg 
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The F39B2.3 gene is located on chromosome L Regulatory sequences can be found e g in the 
region between 14 797 521 and 14 798 310. An exemplary sequence of this region feas' foUows: 

CATTTTGAAAGTGCCCiWiGTTGCTGGAACGCTGAAAATTGAAATTATTAACAAAGAAATTTGCTTTAAkA 

TCCGAAAAATCAAGAAAAAATCGATAATTTCGTCGGACAATCCGCCTGCTAGCACGGCTTGACGCTCGTT 

TGCCGCGCGCTCATTCGATTTGTGTGAGTGCCCAGTGGAGCGCGTTTGCTAAGGCTAACTGTGTAGTCCT 
CTCGGACAAGATCTGTGAACATTGAJATGAAACACTTGGGTTCAATjUVAATOVCA^ . 

TTTTGTTTGCGACCGAAAAAAAATTATAAAAATTGAATATT GGTTATCATC GTTTCAATCTTTOTTTTnn. 

-469 

ATTAAAGGCACAGCTGCTAAAAATTGTTTTTTTTTTTTCAATTTTGCTAAAAGAAAATCAATTTTCTGAT 
TTTTTGTTGAGTTCCCGTGCAAATCAATGTCCTAGCTTTTTAAAATTGTTTTTTGTTATGTAATTCTAAT 
CAAATTTTGTCGAATTTTCAGAGATTTTCTGCTAAAACACTAAAAATAGTCTAAAAGTCGATAATTTGAT ' 
AAAO^TTTACTCa^AACCTTTTACGGAAAAATGAAACS^AAAGTTGCAAAAATATAGTAATTTCGC 

CTGAACGCGTACTTAAAGGTACACGGTTTGATTCGGATTGGTCCCGCCACAAAGTGTTACCATAACATTT 

TTCTCGCTGCGAGACCCATCCGAATAAATCCGTGCGCCTAATOVGTGCGAGTACGCATTTCATATTACTG 
ATAAGTGCCATTTTTAGAACAAT6 
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Exemplary F39B2.3 sequences: 




MSKSICKSSM 
VGESVKNVKV 
SALMQIAAWR 
AVIGNRAETT 
NLWQIEH 



RAAWRRFGA PDVIEAVESD MPRLEKNQVL VRNYAAGVNP VDTYIRAGQY 
GDRVWYGSEA DSTAEYVAVN RPFELPEGVS FEEGASLGVP' YLTAYRALFH 
NIEAVGTAGS ADGIRFVKSL GARNVYNHSD KQYVSKMKND YPGGFNHIFE 
INARQLMVTE GAVYGVALGM SSEAELLDFG INIVSFLKET EFRPLINKLY 




GKLPNLPYVP 
LAGAKTGDVI 
MAAHTNLNTD 
RLEQLGLAHE 



GKDGAGFVEL 
LVHGASGGVG 
LGLLAPRGRV 
EIMNNKGAKG 




ATGAGCAAAT 
.CGT^GAGTGAT 
ATATTCGTGC 
GTGGGAGAAA 
TGCGGTGAAT 
CTTATCGTGC 
AGTGCACTGA 
GAAGAGTCTT 
GCT-TCAACCA 
GCAGTAATTG 
ATTGGGAATG 
CACTTATAAA 
AATCTTGTAG 




CGATTTGCAA 
ATGCCCAGGC 
TGGTCAGTAT 
GCGTTAAAAA 
CGACCATTCG 
ATTGTTTCAT 
TGCAGCTGGC 
GGTGCACGGA 
CATTTTCGAA 
GAAATCGCGC 
TCTTCCGAGG 
CAAATTGTAT 
TGCAAATCGA 



ATCAAGCATG 
TTGAAAAAAA 
GGAAAACTAC 
TGTGAAAGTC 
AGTTGCCGGA 
CTTGCTGGTG 
TGCCTGGAGG 
ATGTCTATAA 
ATGGCTGCTC 
CGAGACCACG 
CTGAGCTCTT 
CGTCTCGAGC 
ACATTAAttc 



CGCGCAGCTG 
CCAGGTTCTC 
CAAATCTTCC 
GGCGATCGAG 
AGGAGTTTCG 
CAAAGACTGG 
AACATTGAAG 
TCATTCGGAT 
ACACAAATCT 
ATCAACGCAA 
GGACTTTGGC 
AATTAGGACT 
attattttaa 



TAGTCCGACG 
GTTCGGAATT 
ATATGTACCA 
TCTGGTATGG 
TTTGAGGAAG 
CGACGTTATA 
CTGTTGGCAC- 
AAGCAATAT6 
GAACACGGAC 
GACAACTTAT 
ATCAAC2ATTG 
GGCTCATGAG 
cacgccattt 



ATTCGGAGCA 
ACGCTGCCGG 
GGAAAAGATG 
ATCAGAAGCG 
GAGCTTCTCT 
CTTGTACACG 
TGCTGGATCT 
TGTCGAAAAT 
CTC6GATTGC 
GGTTACAGAA 
TCTCATTCTT 
GAAATTATGA 
aaaggaa 



CCTGATGTCA 
TGTCAATCCA 
GAGCCGGATT 
GACAGTACAG 
CGGAGTGCCT 
GAGCATCTGG 
GCTGATG6GA 
GAAAAATGAT 
TGGCTCCACG 
GGAGCTGTTT 
GAAGGAAACC 
ACAACAAGGG 



TAGAAGCCGT 
GTTGACACAT 
CGTCGAACTT 
CAGAGTATGT 
TATCTTACCG 
TGGAGTGGGA 
TCCGGTTQGT 
TATCCAGGAG 
TGGTAGAGTT 
ACGGTGTAGC 
GAGTTTCGTC 
AGCGAAAGGA 
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The R03D7.6 gene is located on chromosome IL Regulatory sequences can be found eg in the 
region between 10949088 and 10950317. An exemplary sequence of this region is as foUows : 

. . . . • *^ ; 

■ • • • 

AGAACTTTTCGAGAAGTCTACCGTTGTAGTTTTCGAAATAGTAATTTATTTAGTGACGTTTATAAAGGTTTACATGATTT 
GGTTTGGAAATTTTTTAGGAGTTTATTCATAAAAACAAAGTAACCATGGACATTCCAGAAGTCTATAGTACACGCGATCC 

TACCGTACCCTTCAGTATTTCTATCAGATTGATAGCTTTCGGTAGTCAGGTACAGCCTAAAAAATTCCTGCTTOCCTTTT 
T.GCCTACATGTCTGCCTACCTTCAGTCATAATGCCTACATAATG . ■ ^^^'-^"'.^'-'-^^^^ 

-947 

• • • 

MniTTCCy^TTGAAACTTGCAGACAGAAATTCAAATGGCAAAAAGAAACAAACACCGAAAOiTTAATC^ 

• ■ 

CAT.TTCTTTTCATATCAGTTTTCCTGTCAAAGCACATTTCTGGAGTCTGTGTGTATTTTTTTGTGTCTTTATGTGATCGG 
TGTTGTGAAATTTGTAGTTGATGTTGATAACATACTTTTTTTTGAAACAAAAAGTGATTGATTAGGCTTGAATTCAGAGA 
TATGTTCGTGATACTTTGCGATTCTCGAGCCAAAAACACGGTATCCGGTCTCGACACGACAACTTTTTCGCAAAATACAA 
•GCTGATGTGCGCCTTGAAAGAGTACTGTAATTTCAACCTTTCGTTGTTGCGGAATTTTCATAGTTTCTCGTTCAAAATAT 
ATGTATTTATTAAACAAAAAACTAAAACAAAACAATTGAGAACAO^TAAATTGTGAAAAATCAATGAGACCACAGCAAAA 

AATTTTGTATCTAGAGTACTCTTTAAAGGCGCACATCGGTTCTTATTTTCAGCAAAAATGTCGCTTCGAGACCGGGTACC 
GTATTTTTTTTTGTGCAftAACTTTAGGTCTAGGTAATATTAAAAAAAAATTCCAO^Ai^CTAGAATCTAGAGCTTTCCAT 
TAAATTTTTT GATGACATTT GAAAATTCAT GATGATTTTT TTCCAACAATTTCPiAAATATr'rr'Tf^Ta.TTr.^r.r.Tg^TCC 

_ « " 

■ 

ACTGAATTCTCTTTCCGAAAGACCACCACAATTTCAGGGCTCCGCCCATTTCGTGGTTTGTAGCCTTCCCGACCCTRrCT 

TTTTGATGACAATTGTGAGAGAAGTGAGAGGTTCAGACACAAAAAGCGACGTGGTCGAATGA 

-149 



FIG. 11 



wo 2005/049790 



12/39 



PCT/US2004/019046 



Exemplaiy R03D7.6 (gst-5) sequences: 







MVSYKLTYFN GRGAGEVSRQ lEAYAGQQYE DNRVTolgiP^SSS^^r^'^^^^^^^^^^^™ 
GKTAWEEAQV NSLADQYKDY SSEARPYFYA VMGFGPgSve S^^^ KKLAQSHAIA RPLAREFKLN 

AIAQHSADLI AKGGDFSKFP ELKAHAEKIQ AIPQIKKMIE TRSSpr ^^FYGFLVN FLKASGSGFL VGDSLTWIDL 




ATGGTTTCCT 
ACAATACGAG 
CATTCCTCGA 
GGAAAAACCG 
TTTCTACGCT 
TCTACGGATT 
GCTATTGCCC 
GAAGATCCAG 
tgcaaataaa 



ACAAGTTGAC 
GATAATAGAG 
AGTCGACGGT 
CCTGGGAAGA 
GTCATGGGAT 
CTTGGTCAAC 
AACATTCAGC 
GCGATTCCAC 
tatttttttt 



CTACTTCAAT 
TCACTCAGGA 
AAGAAGCTTG 
GGCTCAAGTG 
TCGGTCCAGG 
TTCTTGAAGG 
TGATTTGATT 
AAATCAAGAA 
tttt 



GGACGTGGCG 
ACAATGGCCA 
CTCAATCCCA 
AACTCTCTTG 
AGACGTTGAA 
CTTCGGGATC 
GCCAAGGGAG 
ATGGATCGAG 



CTGGAGAAGT 
GCATTGAAA6 
CGCGATTGCT 
CCGATCAATA 
ACTTTGAAGA 
CGGATTCCTT 
GTGATTTCAG 
ACCCGTCCAG 



GTCTC6TCAG 
AAACCTGCGC 
CGTTTCTTGG 
CAAGGATTAT 
AAGACATCTT 
GTCGGAGACT 
CAAGTTCCCA 
TCACACCATT 



ATTTTCGCCT 
TGCTCCATTC 
CTCGTGAGTT 
TCAAGTGAGG 
CCTTCCAGCA 
CTTTGACCTG 
GAGCTCAAGG 
CTAAatagct 



ATGCCGGACA 
GGACAACTTC 
CAAGCTCAAC 
CTCGTCCATA 
TTTGAAAAGT 
GATTGACTTG 
CTCATGCCGA 
gtataaaatc 
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The F35E8.8 gene is located on chromosome V. Regulatory sequences can be found e g in the 
region between 15 917 841 and 15 918 925. An exemplary sequence from in or around iiis region 
is as follows: . 



TCTCATTCTCTTCAAGACATAACACAACGGGCTGACGACCATATCATCAACGACGATTTTTTAGGAACTG 
TACTTTATCTGTGTCTGACCAACACGTGTGAATGAAGTTTCAACTGGAAAATTTGTTTGAAACACTGCAA 
AGAATTTCGAATTTT GATGATAAT TTTAAATGCCATTATCARTTTTanTnrrirr-an.p/^TASTCTTTCATT 



-240 



CTTTGCACACACACACACACACACACACACACACACACACTCACAAACACGCCTGAAATTTCGCAATAiPG 
CTGATTTAACGAGAAAACATTT GATGACAATA AACTTGGCGTATTAaTATAaanr:nr;aaaaTn.r^A^TT?n 



-94 



GATTCTCAACGGTTTATTTTCTGTCACAACTCTTCCTAATATTCACCATGGTTT 



FIG. 13 



wo 2005/049790 



PCT/US2004/019046 



14/39 



Exemplary F35E8.8 (gst-38) sequences: 





MVSYKLTYFD GRGAGEI.CRQ IFAAAEQKYE DNRLTDEEWE KfraAGOTPYNQ^^ 

GKSRWEEAQV NSIADQYKDY YAEARPYIAV KLGYTEGDAE ALYTSVYLPV ETOOIYGFFVN AL^GSrpT u^^^™ 
LVAQHSADLL GREKSDLEWD VPEMKAHSEK VQSIPQIKKW lETRPASDW "'^''^^^ ALKASGSGFL VGNSLTFIDL 




ATGGTTTCGT 
GAAATATGAA 
CAATGCTCGA 
GGAAAGAGCA 
CCTCGCTGTG 
ACTATGGATT 
CTTGTTGCTC 
TTCCGAAAAA 



ACAAGCTTAC 
GATAACAGAC 
GGTAGATGGC 
GATGGGAAGA 
AAGCTTGGTT 
CTTTGTCAAT 
AGCATTCAGC 
GTTCAGTCAA 



CTACTTCGAT 
TTACCGATGA 
AAACCACTC6 
AGCTCAAGTC 
ACACAGAAGG 
GCTTT6AAGG 
TGATTTGCTG 
TTCCTCAGAT 



GGACGCGGAG 
6GAGTGGGA6 
CTCAGTCCCA 
AACTCCTTGG 
AGACGCGGAG 
CCAGCGGGTC 
GGACGTGAAA 
CAAGAAATGG 



CCGGA6AGCT 
AAGTTCAAAG 
CGCGATGGCT 
CCGACCAGTA 
GCTCTTTACA 
AGGATTCTTG 
AGTCGGATCT 
ATTGAGACTC 



CTGCCGTCAA 
CGGCCGGAAA 
CGTTATCTTG 
CAAAGACTAT 
CAAGCGTCTA 
GTTGGAAATT 
TTTCAATGAT. 
GTCGAGCGAG 



ATCTTTGCTG. 

AACCCCATAC 

CTGGGGAATT 

TACGCGGAGG 

TCTTCCAGTT 

CCTTGACTTT 

GTCCCAGAGA 

TGACT6GTAA 



CCGCCGAGCA 
AACCAGCTTC 
CGGGTTGAAC 
CTCGTCCATA 
TTCAAGAAAC 
TATTGATTTG 
TGAAGGCAC^V 
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Naned genes 
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glutathione S-transferase; gst-6 
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The Fl 1 Gl 1 .2 gene is located on chromosome L Regulatory sequences can he fn»n^ « „ • 
region between 4 880 968 and 4 882 068 An exemifiaJ cZ, • ^ 
foUows: ^ oe^ An exemplary sequence in or around this region is as 

• ■ 

TTCiW^GCACAAAACCAATCACGTCATAATGTTSJS?????^?^^ 



-391 



ggtgagtggagagacgacagacAttJS^SS^^^ctS?^^''^^^^^^^ 



-133 



gagacggccggcattcagtgttcaacccttctcatcgaccactcgatttct^^^^ 
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Exemplary Fll Gil. 2 (gst-7) sequences: 



MVHYKVSYFP IRGAGEIARQ IIAYAGQDFE. DNRIPKEEWP A\^ PStXf^ ^ 

CAWEEAQVNS VADQFKDYUJ EVRPYBMVKM GFAEgSdAL SvSpgS ^vJ^v^^^ LAQSHAIARY IARQEX31NGK 
AQHTADLLAA NAALLDEFPQ FKAHQEKVHS NANiKKwS ^^^^^^^I" ' KSAGSGYLVG " DSLTFVDLLV 




cgaccactcg 
AGATTGCTCG 
AAGCCAAGCA 
TTACTTGGCT 
AGGATTACCT 
GACGTCTTCC 
TGGAGACTCT 
ATGAATTCCC 
GTTACTCCAT 



atttcttgct 
TCAGATCTTG 
CTCCATTCGG 
CGTCAGTTCG 
CAACGAAGTT 
TTCCAGGATT 
TTGACCTTTG 
ACAATTCAAG 
TCTAAatgat 



tggttatttc 
GCCTACGCTG 
ACAGCTTCCA 
GAATCAATGG 
CGTCCATACT 
CAAGAAGCAC 
TCGACTTGCT 
GCTCATCAGG 
ttcca 



aacaATGGTC 
GACAAGACTT 
CTCCTTGAAG 
AAAGTGTGCA 
TCATGGTGAA 
TATGGATTCT 
CGTCGCTCAG 
.AAAAGGTTCA 



CACTACAA6G 
CGAGGACAAC 
TTGACGGAAA 
TGGGAGGAGG 
GATGGGATTT 
TTGCTAACTT 
CACACTGCTG 
CTCGAATGCC 



TATCGTACTT 
AGAATCCCAA 
GGTTCTTGCC 
CTCAAGTCAA 
GCTGAAGGAG- 
CCTCAAGTCG 
ATCTTCTGGC 
AACATCAAGA 



CCCAATTCGT 
AGGAGGAATG 
CAATCTCATG 
CTCGGTTGCT 
ATCTCGATGC 
GCTGGATCCG 
TGCCAACGCA 
AGTGGTTGGA 



GGAGCTGGAG 
GCCAGCTGTC 
CTATCGCCCG 
GATCAATTCA 
TCTTGCCAAG 
GATACTTGGT 
GCTCTTCTCG 
GACTCGTCCA 
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The K08F4.7 gene is located on chromosome IV. Regulatory sequences can be found e.g in the 
fXws: ^^^^Pl^^s^l'^^ceoftMsi^gionS '^ 

ATTATCa^AAAAGATTAGAAGTTGGCAAACCTTGGG^ 

• CCAACTTTATCCAATCTTlTACTAAAArrATCCraAAGACTATrrAAATTT^^^^ 
CACTTC3GATATGACTTATAGTOAGCCTAACCTC3AAGCTATTGCTTGCTTOAT^^ 

TAACCAGCTCCAATAGTTGTTATTTTTGC TTTTGTCATC ATTTT a^ittatcgcTTTGCTACTTGGA 

-157 

TCOlCGATTTACACTCTCAAGTGAAACa^CTGTTCTTTGATGCCA^ 

-83 

AAATATATATAAACTGOAATTAAAAACAATTGATACATCGATTCAATTACTGAATTCTAATTATO 
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Exemplary K08F4.7 (gst-4) sequences: 




MPNYKLLYFD ARALAEPIRI MFAMLNVPYE DYRVSVEEWS KIJCPTTPE^ 

TAEEEAYADS IVDQYRDFIF FFRQFTSSVF • YGSDADHINK SSvV^^ mF^S^^NK? f^^f^^f ^^^^ lARKFGLAGK 
lADNLTSLLK NGFLDEWKEK KLEEFYNKIH SIPE^JSS DDFLAIINKF LAKSKSGFLV GDSL^WADIV 




ATGCCAAACT 
GCCTTACGAG 
TACAAGTCGA 
ACTGCAGAGG 
TTCCGTTTTC 
TGGCAATAAT 
ATTGCTGACA 
CAAGATTCAT 
tctgaattat 



ATAAGCTATT 
GATTATAGAG 
TGGAGAACAA 
AAGAAGCTTA 
TATGGAAGTG 
CAATAAGTTC 
ATTTGACAAG 
TCAATTCCAG 
gtatgtagta 



GTATTTTGAT 
TTTCAGTGGA 
TTCGGTCAGT 
CGCTGATTCA 
ACGCTGATCA 
CTGGCCTVAGA 
TCTCCTGAAG 
AAATTAAGAA 
aaataatatc 



GCTCGTGCTC 
AGAATGGTCA 
CAATGTCTAT 
ATTGTAGAT.C 
TATTAACAAA 
GTAAATCAGG 
AATGGATTCT 
TTACGTGGCA 

gttcctat'ca 



TTGCTGAGCC 
AAGCTGAAGC 
CACAAGATAC 
AATACAGAGA 
GTACGTTTTG 
ATTCCTCGTT 
TAGATTTCAA 
ACAAGAAAGG 
cgtctcccag 



AATCCGTATC 
CAACGACTCC 
TTGGCAAGAA 
TTTCATATTC 
AAGTTGTTGA 
GGAGACTCAT 
CAAAGAAAAG 
ATAGTATTGT 
agagcgtaat 



ATGTTTGCAA 
ATTTGGCCAG 
AATTTGGAGT 
TTTTTCCGTC 
ACCAGCCCGT 
TGACTTGGGC 
AA6TT6GAAG 
TTAAaatcga 
aaattattat 



TGCTCAATGT 
CTTCCCATTT 
CGCTGGAAAA 
AATTCACTTC 
GATGATTTCT 
TGATATTGTG 
AGTTCTATAA 
attatttaag 
tatgtg 
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The sod-lg&ae is located on chromosome H. Regulatory sequences can be found e b in ttie 
between about 6 973 806 and about 6 974 406. An exempSy sequence of fSo^: 



tcttcatatcttgtttctggaaacgaaaatttccaacatgaaagaScccgaJS^^ 



GTTTGATGCGTGCACyvCTATTTTGCTGTTGTTTTTTTCTTTTTTCTcVAAATTTTCTT^^^ 

aaaaatcctttttcttgaaataatcgttouvtttcgagttttcaagagtgS 
cgcttattttttctgtgtttttgttttgtggtttttaatcagtgtcWt^a^^ 

-64 

TTATTATTCAAAGTTGTAGATTCAGTATTTTAGATCGGTGA- 



T6 
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Exemplary sod-1 sequences: 




_ — — ---^-^-^^^§1^ 

ME^LLTQVS NAIFPQVEAA QKMSNRAVAV I^GETVTGTI WITQKSENDQ AVIEGEIKGL^^^^^^^?^ 

^'^""^ AGADGVAKXK I.TDTI.VTI.VG PNTWGKSmJ ^SS^ SJgSsK 




tttagatcgg 
GTCGAACCGT 
AGGCAGTTAT 
AACGGATGCA 
AGGCGATCTA 
GTCCAAACAC 
GAAGAGTCCA 
tgaatcgcgt 
atcccgttgt 



tgATGTTTAT 
GCTGTCGCTG 
TGAAGGAGAA 
TTTCTGCCGG 
GGAAATGTGG 
TGTCGTTGGC 
AGAA6ACTG6 
ctctgaatct 
tcctactcct 



GAATCTTCTC 
TTCTTCGTGG 
ATCAAGGGAC 
TCCACACTTC 
AAGCTGGAGC 
CGATCTATGG 
AAACGCCGGA 
ccacacaatt 
actactgtat 



ACTCAGGTCT 
AGAAACTGTT 
TTACTCCCGG 
AATCCATTTG 
CGATGGAGT(3 
TTGTTCATGC 
GCTCGTGCTG 
cctactaaag 
attttcacat 



CCAACGCGAT 
ACCGGTACTA 
TCTTCATGGA 
GAAA6ACTCA 
GCAAAAATCA 
CG6ACAAGAC 
CCTGCGGTGT 
acaatttttc 
aaaatttctt 



TTTTCCGCAG 
TCTGGATCAC 
TTCCACGTTC 
TGGTGGACCA 
AGCTCACCGA 
GACCTCGGCG 
CATTGCTCTC 
atttcttgct 
caaaatttca 



GTCGAAGCCG 
ACAGAAGTCC 
ACCAATATGG 
AAATCCGAGA 
CACGCTCGTC 
AGGGAGTCGG 
GGTGCTCCCC 
ttgtcgttat 
aataaaggtt 



CTCAAAAAAT 
GAAAATGACC 
TGATTCCACC 
TCGGTCACGT 
ACGCTTTACG 
AGACAAGGCA 
AGTGActacc 
attcttaaga 
gtagtttc 
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Haned genes 



Gene Kodels 

F02E9.9 

■^E^ ^ 

ttO donain, GHaeta repeat 

Operons 

Y flte tt Cosnlds 
FQ2E9 



FlOOll.l e4Mvi« 




FIODU 



The 5o</-2 gene is iQcated oh chromosome 1. Reeulatorv seauenrp.: ran ^ 

bCw.™ *ou. 8 441 038 and 8 442 612. A. ^^oTS^'r^^f^SZT"^ 




^ ICGTTGCA 

95 9 

CCAACGGACATCAAftGTATCAAAGTAAG 
AAATTAAGTGAGAATTGAAGGATTGCT' 
-363 ^^*-*"TATTAGACTACCATAAATATTACGATAAT 



CCGACCGTGAAAAATGGAAAGAGTTTGTCCGCAGTTGCCTCGCAACTTO 

taagtaagtaagtaacctgaataaaaacgttgcaattaSS^tcS???^S^ 

TCCGA^GAGAAAATGAC^TATAGGGTATACTAA^ 



-191 



• ■ 
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^cemplary sod-2 sequ^ces: 




MLQNTVRCVS KLVQPITGVA AVRSKHSLPoiiFSrpYaSi^^ 

AIALQPALKF NGGGHINHSI EWTNUVKDGG EPSAEHTAI KSD^SLDNL QKQLSAST^ VosicwS TCPKGK?^ 
ATCANQDPLE ATTGLVPLFG IDVWEHAYYL QYKNVRPDYV NAIMKIANWK NVSERFAKAQ q^^^'^"^'^ YCPKGKILKV 




tttgcagccg 
GCTCGAAGCA 
CATCATCAAA 
AAACGTCAAA 
CTAATTTGGC 
CTTCAAAAAC 
GATCTTGAAG 
TCTGGGAGCA 
AA6AACGTCA 
gaagttgact 
aaagttttca 



aaaATGCTTC 
CTCGCTGCCA 
AGCATCATGC 
GAAGCTATCG 
AAAGGACGGA 
AGCTTTCGGC 
GTTGCCACAT 
CGCTTACTAC 
GCGAGCGTTT 
cccattgttt 
ccg 



AAAACACCGT 
GATTTACCAT 
CACTTATGTG 
CTCTTCAGCC 
GGAGAACCAT 
ATCAACTGTC 
GTGCCAATCA 
TTGCAGTACA 
TGCAAAGGCA 
tgtaactatt 



TCGCTGTGTC 
ACGACTATGC 
AACAATCTCA 
AGCTCTCAAG 
CGGCGGAGTT 
GCTGTTCAAG 
GGATCCACTT 
AGAATGTTCG 
CAGCAATAAa 
tttgtttctt 



TCAAAGCTTG 
TGATTTGGAG 
ACCAAATTGA 
TTCAATGGA6 
GCTCACCGCA 
GATCAGGATG 
GAGGCAACAA 
ACCAGATTAT 
tgagctgaat 
aattatttcg 



TTCAACCGAT 

CCTGTAATCA 

GGAAAAGCTT 

GAGGACATAT 

ATTAAGAGCG 

6GGATGGTTG' 

CTGGACTTGT 

GTCAATGCTA 

cacaagaatt 

aaatgtaaat 



CACAGGAGTC 
GTCACGAGAT 
CACGAGGCG6 
CAACCACTCC 
ACTTC6GATC 
GGATACTGTC 
TCCACTGTTC 
TTTGGAAGAT 
aatcgtcaaa 
tttcaaacct 



GCTGCTGTTC 
TATGCAACTT 
TCTCCAAAGG 
ATCTTCTGGA 
TCTGGATAAT 
CAAAGGGAAA 
GGAATTGACG 
CGCCAACTGG 
tgtagcagta 
tttcaaatga 
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'^tiS^l'f chromosome H. Regulatory sequences can be found e.g., in the region 

near 14 306 135. An exenq)lary sequence of this region is as follows: 

AAAAAAATU^TCGATAAAAAATCCGCGTCAACGAAAGTTTAAAGTTACAGTATTTGTCGTTTCGAtSACrrr 
GTACCGTAGTTTTTGGTGAAAACATTGCAAAATTTGGTCAACAATTTCATCGCTGCGAGACCGACACAAr 
ACTTTATTTTATTTTTGGGTTT&CCTTATCGCTTATCATAAACATGTGA CGTCATCATC TCT TGTACAGA 

-997 -978 

GCACCGCGACTGGGAGTATAAGAATCGCCGGAAAACATCAATAATCAGTTCGGTAGAAGTGAAAATTrar 

CGTAAAATMGATCMTTTTCGATGCACCATATTTGACGCGC^ATACTTCTAbuiGCCGCT^ 
—880 

• . 

TCGTGGACAACTTTGGATTATTTTTTGTTTTTAAAATTCAAAATAGTCAATATATTGCTTATTTATAGcr 

CGCCTTTTTGACAGTAAGTTTGTCAAATTTGCGCGTAAGTTATGGTGTTTGCACATATGCACCATA^rr 

AACACCCCGCGGCCCGGCTAGTGGTACATCCATGCAAATGCGCTCTACTGATAATTTGAGTTTAACCAGG " 

TTTAGGCGCAAGATAAGAAAAAAGCTTTGGACCAAAAAATTTAGAGTTTATTTTTTTCGGACATTTTTTa 

TATACATCACAAAAATATTGGGCCACTCGTTTTTGATAAAAACGACAAGCCCAAAAGTTCAGGTATACGG 

TAGACAAATTGCGTACAGGTACCACTTTTCCACGTAGTGCCAGGTTGTCCCATTACGCTTTGATCTATrA • 

AAAATGCGGGAATTTTTCGTCCAGAAAAATGTGACGTOVGCACGTTCTCAACCATGCGAAATCAGTTGAA- 

aactctgcgtctattctcccgcattttttgtagatctgtagatttgtagatcaatccattccccctataS 

CCTGACCCATAATCAATACCTACCTAATTTTTGTCTTTCCCCCTACTTTTTTGCCTGTCCAAAATMrrr 

agactatgccgtagtctggtgtccaacaacatgttccttatcagtgataacgctacaatct^ttcttt' 

TTTCTCTGTTTCTCTTGTCTCTCCCAACCCATATTCCGTATTACACCTCGTCGTGGTCATTTTTTTGTTP 
AGAGTTTTATTTAATTCTAAATTTCCTAACTAAAATTTCAGA AiiiAitiiii. 
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Exemplary ctl-1 sequences: 




MPNDPSDNQL 

FEVTHDITKY 

IHALKRNPQT. 

LDPTDAGKIiA 

YFAEVEQAAF 

PNSFRGYRTR 

HPDFGNALRH 



KTYKETYPKP 

CKADMraBeVG 

HMRDPNALFD 

SSDPDYAIRD. 

CPAHIVPGIE 

DDVKESTFQT 

QLCQKKH 



QVITTSNGAP 
KQTPLLVRFS 
FWMNRPESIH 
LFNAIESRNF 
FSPDB^MLQGR 
TGDVDRYETG 




lYSKTAVLTA 
TVAGESGSAD 
QVMFLYSDRG 
PEWKMFIQV^5 
IFSYTDTHYH 
DDHNYEQPRQ 



GRRGPMLMQD 
TVRDPRGFSL 
IPDGFRFMNG 
TFEQAEKWEF 
RLGPNYIQLP 
FWEKVLKEEE 



WYMDEMAHF 
KFYTEEGNWD 
YGAHTFKMVN 
NPFDVTKVWP 

VNCPYRSRAH 
RDRLV6NLAS 



DRERIPERW 
liVGNNTPIFF 
KEGNPIYCKF 
HGDYPLIBV6 
TTQROGAMAY 
DLGGCLEEIQ 



HAKGAGAH6Y 
IRDAXHFPNF 
HFKPAQ6SKN 
KMVIiNRMVKN 
ESQGDAPNYF 
NGMVKEFTKV 




CTGAAAACCT 
CGTGCTCACC 
AACGTATCCC 
TACTGTAAGG 
GGGATCCGCT 
GAAATAACAC 
ACTCACATGA 
CTCGGATCGT 
GAAATCCGAT 
GCCTCTTCGG 
CATTCAAGTG 
ATTACCCACT 
TTCTGCCCGG 
CACGCATTAC 
AACGCGATGG 
CGTGATGATG 
GCAGCCACGT 
GTGGCTGTTT- 
CATCAGCTCT 



ACAAGGAGAC 
GCCGGGCGGC 
CGAGCGTGTC 
CCGATATGTT 
GATACTGTCC 
TCCGATCTTC 
GGGATCCGAA 
GGAATTCCTG 
TTATTGTAAA 
ATCCAGACTA 
ATGACATTCG 
GATCGAGGTC 
CCCACATCGT 
CATCGCCTTG 
TGCAATGGCT 
TGAAGGAGTC 
CAGTTCTGGG 
GGAGGAAATT 
GCCAGAAGAA 



GTATCCAAAA 
GTGGCCCAAT 
GTTCATGCCA 
CAACAAGGTC 
GCGATCCACG 
TTCATTCGTG 
TGCGCTCTTC 
ATGGATTCCG 
TTCCATTTCA 
TGCGATCCGC 
AACAAGCTGA 
GGCAAGATGG 
CCCAGGAATC 
GACCAAACTA 
TATGAAAGCC 
GACATTTCAG 
AGAAA6TGCT 
CAAAATG6AA 
GCATTAAatt 



CCCCAAGTGA 
GCTCATGCAA 
AGGGAGCCGG 
GGAAAACAGA 
TGGATTCTCT 
ACGCAATCCA 
GATTTCTGGA 
TTTTATGAAT 
AGCCTGCTCA 
GACCTGTTCA 
GAAAT6GGAG 
TGCTGAACAG 
GAGTTCTCGC 
CATTCAGCTT 
AGGGAGATGC 
ACGACTGGAG 
CAAGGAGGAG 
TGGTCAAAGA 



TCACAACTTC 
GATGTAGTTT 
AGCCCATGGA 
CACCACTTCT 
CTCAAATTCT 
CTTTCCGAAT 
TGAATCGCCC 
GGATACGGAG 
AG6TTCCAAG 
ATGCCATTGA 
TTCAATCCAT 
GAATGTGAAG 
CAGACAA6AT 
CCAGTCAACT 
6CCGAATTAC 
ATGTTGATCG 
GAGAGAGATC 
GTTCACGAAA 



aaaATGCCAA 
AAATGGAGCT 
ATATGGATGA 
TACTTCGAGG 
CGTTCGTTTT 
ATACCGAGGA 
TTCATTCATG 
TGAATCCATT 
CGCATACTTT 
AATCTCGATC 
GTCAAGAAAT 
TTGATGTCAC 
AATTATTTCG 
GCTCCAAGGG 
GCCCGTACCG 
TTCCCGAACA 
TTATGAGACT 
GGCTCGTTGG 
GTTCATCCGG 



AC6ATCCATC 
CCGATCTACT 
GATGGCTCAT 
TCACCCATGA 
TCAACGGTCG 
6GGTAACTG6 
CCCTGAAGC6 
CATCA6GTGA 
CAAGATGGTC 
CAACTGACGC 
TTCCCGGAAT 
TAAAGTTTGG 
CTGAGGTCGA 
CGTATCTTCT 
CTCCCGTGCT 
GTTTCCGCGG 
GGAGACGATC 
GAATTTGGCT 
ATTTCGGAAA 



GGATAATCAA 

CGAAGACCGC 

TTCGATCGTG 

CATCACCAAG 

CTGGAGAATC 

GATCTGGTTG 

CAATCGAGAG 

TGTTCCTCTA 

AACAAGGAGG 

TGGAAAGCTC 

GGAAGATGTT 

CCACACGGTG- 

ACAAGCCGCC 

CCTACACGGA 

CATACCACTC 

ATACCGTACT 

ACAACTACGA 

AGTGATTTGG . 

TGCTCTTCGC 
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Naned genes 



Gene Hodels 
C08f)9.9 



Operons 

YRCs ft Cosnids 
4 — , 




superoxide disnutase; sod-^ 



The sod-3 gene is. located on chromosome X. Exemplary regulatory sequences include: 

TflOTCGOVGAAAAAAGTCGTTGCAAAaVTTOTT^ 

ttgaaagtgaaaaatatttqcttaaaacttccaaatttaaatctgcagtgattSga 

ATTTTCAAAAACATTCAATGTTTTCCCTa^GAGTGACTATGC^^ 
TGGATGCCCTGATAAAAAGTAGGTGAAATTTCQOIGGQQAACATCATATTA^^ 

GAAATGGAAATGTTTGTCGGTGGTATGCTCGAATATTTOAGATAITATATATTTA^^ 
TTTTTGACAAACGGAAAAAATTTGTGTC^ 

CTTATAAAAACTGTTTGACTATCTTATTTCAGGAAAAAAAAATCCAAGAATAAACAT^ 
AACTTTCTAATGGCTGATTAATAAAACAAAGTTATACAACTATTC^ 



TCITCTGTTTTTTTTTGAATATTTCATC^ 
ACAATTTTlXS^CCaAAAATTGACCTTTGAOT^^ 
TOAOTGCTGTTGATGAGGTCTTTGATC^TTTGTATTGTT^ 



M|2CMerAATATATTGT<^CAACTTCTTGTTTTTTTTTTa^^ 

« o / 



OTAAOVAAGGTTCACACAAaVACTCTCCTCTCCATCTCTTTCTCTa^^ 
!;^!!!^S'''^^°°^™^™^°^°'rTTTCAAGATTTTTGGTCTC^^ 
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MLQSTARTAS KLVQPVA6VL AVRSKHTLPD LPEDYADLEP VISHEIMQLHHQliimTY\^ 

AIALQPALKF NGGGHINHSI FWTNLAKDGG EPSKELMDTI KRDFGSLDNL Q^LSDITIA VQGsSSS TC^^SJJSS 
ATCANQDPLB GMVPLFGIDV WEHAYYLQYK NVRPDYVHAI WKIANWKNIS ERFANARQ YCKKDKILKI 




cgtttgtcaa 
TTGCGGGAGT 
AGCCATGAAA 
TCACGAGGCT 
TCAAtCATTC 
GACTTCGGTT 
GGGATATTGC 
TTGGAATTGA 
ATTGCCAACT 
acgaaaatat 
cacagttttt 



atcttctaat 
TCTCGCCGTC 
TCATGCAGCT 
GTTTCGAAAG 
TATCTTCTGG 
CCCTGGATAA 
AAGAAAGACA 
CGTTTGGGAG 
GGAA6AATAT 
tgaagataat 
tctta 



tttcagtgaa 
C6CTCCAAGC 
TCATCATCAA 
GGAATCTAAA 
ACCAACTTGG 
CTTGCAAAAA 
AAATCTTGAA 
CACGCCTACT 
CAGCGAGAGA 
tcagatgtag 



taaaATGCTG 
ACACTCTCCC 
AAGCATCAT6 
AGAAGCAATT 
CTAAGGATGG 
CGTCTTTCTG 
GATCGCCACC 
ACTTGCAGTA 
TTTGCCAATG 
tttaaaacgc 



CAATCTACTG 
AGATCTCCCA 
CCACCTACGT 
GCTCTCCAAC 
TGGAGAACCT 
ACATCACTAT 
TGTGCAAACC 
CAAAAATGTC 
CTCGACAATA 
tgagaatttg 



CTCGCACTGC 
TTCGACTATG 
GAACT^TCTC 
CAGCGCTGAA 
TCAAAGGAGC 
TGCGGTTCAA 
AGGATCCTTT 
CGCCCAGACT 
Aaagcaggaa 
tatttttgta. 



TTCAAAGCTT 
CAGATTTGGA 
AATCAGATCG 
ATTCAATGGT 
TGATGGACAC 
GGCTCTGGCT 
GGAAGGAATG 
ATGTCCATGC 
atattggaat 
attgtttaaa 



GTTCAACCGG 
ACCTGTAATC 
AGGAGAAACT 
GGTGGACACA 
TATTAAGCGC 
GGGGATGGTT 
GTCCGACTTT 
TATTTGGAAG 
tttcggtttt 
taaaagaacg 
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Gene Models 



Operons 

YflCs a Cosnids 
F37B12 



r37B12.2 



F37B15 




•The^ca-igeneislocatedonchromosoinen. Anexemplaiyregulatorysequenceis: 

— 607 

^316 

atatcactt^cm<^tgagat™tgtticctttto»ttitSJ^SSS™SS 



-121 



ACTGATGTAACTTTCCTTCTAATGTTATCATTTGTATTTTTTTGCAGAA' 



TG 
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Exemplary gcs-1 sequences: 




MGLLTKGSPL 
EEQVNAMLGT 
EVAADRKNDD 
lYMDHMGFGM 
PLKNSKWVID 
EHFETIQSSN' 
QKDAVLHQKF 
QYLNFISKRA 
AEEHMXVSSQ 



TWAETVPHID 
ANRFLWRPEF 
AANSVFWPEQ 
GCCCLQVTFQ 
KSRYDSTDCY 
WMNMRFKPPP 
LFRKGLAECK 
TGEINTLAHW 
KRAH 



YIKKHGIAQF 
GSYMIEGTPG 
AVFLGHPRFK 
AVNVDEARWL 
lYPCSVGYND 
PDAPEIGWRV 
SAPENLKGSE 
TRGFVQSHPA 



IHLYHRLKSR 
MPYGGLIACF 
NLTKNXKGRR 
YDQLTPITPI 
IPLQYDETIY 
EFRPTEVQLT 
KCGPPSQDIE 
YKHDSDVNDN 



HGDQLKW6DE 
NIVEANMKLR 
GSKVAINVPI 
LLALSAATPI 
KQLIDGNIDE 
DFENAAYCCF 
EMSIDEIING 
IVYDLLKKMD 



lEYTIVKFDD 
RQWKKLLKK 
FKDTNTPSPF 
FRGKLSNVDS 
PLAKHIAHMF 
WLLTRMMIS 
KKNGFPGLIS 
AXSNGEDHCE 



ANKKVRVSCK 

DETCLSISFP 

VEDLSALGGP. 

RWDIISASVD 

IRDPHQVFRE 

FRLTYXiMPIS 

LIRQFLDSAD 

KLLGCYRSKT 



AEELXiNKLQA 
SLGVPGFTFP 
DDTRDAKPDH 
DRTPEERGLE 
RIEQDDEKSS 
MVTENMKRAQ 
VDVDTRCTIS 
DHAISAAVRK 




tttgcagsiAT 
CACGGAATTG 
TGAATACACT 
"TACAAGCCGA 
ATCGAGGGTVA 
ACAGGTCGTC 
CATTCCCGGA 
GGCCATCCAC 
CAAGGATACG 
CTGATCACAT. 
GATGAAGCCA 
CCGTGGAAAA 
GATTGGAACC 
TCTGTTGGCT 
ACTGGCAAAA 
AAAGCAGTGA 
GAAATGGGAT 
TGTATTGCTC 
GTGCTCAjGCA 
AATTTGAAAG 
GAAA/^TGGA 
CGATTTCTCA 
GTACAATCTC 
CATCTCAAAC 
TTCGCAAAGC 
aatttattta 
tttagttggc 
aaatttaaac 
tttcatattt 



GGGTCTTTTG 

ctcaattcat 
attgtaaaat 
agagcaggtg 
cccccggaat 
aaaaagttat 
agtagcagct 
gtttcaagaa 
aacaccccca 
ttatatggJat 

GATGGTTGTA 

TTATCCAATG 

TCTCAAGAAT 

ACAATGATAT 

CATATTGCGC 

ACACTTTGAA 

GGAGAGTCGA 

ACCAGAATGA 

AAAAGATGCA 

GATCGGAGAA 

TTCCCAGGTC 

ATATTTGAAC 

ATCCTGCATA 

GGAGAAGATC 

TGAAGAGCAC 

tgttctatac 

catctacatt 

tctgaatttc 

atgtacaatg 



ACG7\AAGGTA 
CAATCTCTAC 
TTGATGACGC 
AATGC6ATGC 
GCCTTACGGA 
TAAAGAAGGA 
GATAGAAAGA 
TCTTACCAAA 
GTCCATTCGT 
CATATGGGAT 
CGATCAGCTG 
TCGATTCTAG 
TCGAAATGGG 
TCCTCTTCAA 
ATATGTTCAT 
ACAATTCAAT 
ATTCCGGCCA 
TGATCTCCTT 
GTTCTCAATC 
ATGTGGACCA 
TCATTTCACT 
TTTATTTCGA 
CAAACATGAC 
ACTGTGAGAA 
ATGATCGTGT 
gtcgtgtttc 
aaattactga 
taaatgcttg 
ctcccctgaa 



GTCCGTTGAC 
CATCGTCTGA 
AAACAAGAAA 
TTGGAACTGC 
GGTCTCATCG 
TGAAACATGT 
ATGATGATGC 
AATATTAAAG 
TGAAGATTTA 
TCGGAATGGG 
ACACCGATTA. 
ATGGGATATC 
TTATTGATAA 
TACGACGAAA 
TCGTGATCCA 
CATCGAATTG 
ACTGAAGTTC 
CAGGCTGACA 
AGAAATTCCT 
CCTAGTCAA6 
TATTCGCCAA 
AACGA6CAAC 
AGTGATGTAA 
GCTGCTCGGA 
CCA6CCAAAA 
ccattccttc 
tatgttgatg 
ttttttgagt 
tccatcatat 



GTGGGCAGAA 
AATCAAGACA 
GTTCGCGTGT* 
CAATCGATTC 
CTTGCTTCAA 
CTATCGATAT 
AGCTAATAGC 
GTCGCAGAGG 
TCTGCACTTG 
GTGCTGTTGT 
CACCGATTCT 
ATTAGTGCAA 
GAGTCGCTAC 
CCATATATAA 
CATCAAGTTT 
GATGAACATG 
AACTGACC6A 
TATTTGATGC 
GTTCAGAAAA 
ATATTGAAGA 
TTTCTAGATT 
TGGAGAGATT 
ATGATAATAT 
TGCTACCGCT 
ACGTGCACAT 
taggccttcc 
ctattttcta 
agtaggaatc 
aattattatc 



ACCGTACCGC 
CGGAGATCAA 
CGTGCAAAGC 
CTTTGGAGAC 
CATTGTCGAG 
CGTTCCCATC 
GTTTTCTGGC 
AAGTAAAGTA 
GAGGTCCTGA 
CTTCAAGTCA 
ACTGGCACTC 
GTGTCGACGA 
GACTCCACGG 
ACAACTAATT 
TCC6TGAGCG 
CGATTCAAGC 
CTTTGAAAAT 
CAATTTCAAT 
GGATT6GCT6 
AATGTCdATT 
CTGCTGATGT 
AATACTTTGG 
AGTTTACGAT 
CTAAAACCGA 
TAGgcgataa 
atgattcaca 
gtaagcagat 
agtacgaatg 
cgtgttg 



ACATTGATTA 

TTGAAATGGG 

TGAAGAGCTT 

CAGAATTCGG 

GCAAACATGA 

TCTTGGAGTA 

CAGAACAAGC 

GCTATCAACG 

TGATACTCGT 

CTTTCCAGGC 

TCTGCCGCCA 

TCGTACACCG 

ACTGTTACAT 

GATGGAAATA 

TATCGAACAG 

CACCACCACC 

GCAGCATACT 

GCaTTACTGAA 

AGTGCAAATC 

GATGAGATTA 

TGATGTGGAT 

CTCACTGGAC 

CTTTTGAAAA 

TCATGCCATT 

ttgattgatt 

atttttcgat. 

gtcagtgttt 

gtacattaat 



TATCAAGAAG 
GAGATGAGAT 
CTTAATAAGT 
ATCGTACATG 
AATTGCGCAG 
CCTGGATTCA 
TGTATTCTTG 
TCCCGATATT 
GATGCGAAAC 
TGTGAAC6TC 
CACCAATCTT 
GAGGAAAGA6 
TTATCCATGT 
TTGATGAGCC 
GACGATGAGA 
AGATGCTCCA 
GTTGCTTCGT 
AATATGAAGC 
TGCTCCCGAA 
TCAATGGAAA 
ACTCGGTGTA 
ACGTGGATTC 
AGATGGATGC 
TCTGCTGCTG 
atgtgatttt 
gccatatcaa 
agtaattcaa 
ctgaaaataa 
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T19E7.2C (spliced) 

aatcgttctt cttcttattt tctacagctg atgatgtttg atgaaggttt tattttcctt acttttt^o* 

ccctgttaat attattttcg atattcccaa aaataattcc aaattttcag tccatattca ?2a««^!^f 

tgcaacatca tcactgattt tggtgatcag ttcaccatcg tccaacacc? caSccaa?c a?cScatac 

gatcggatca cgacaaaaca tcttctggao aatatatcac cgacatttag tgagtatgac ttalaaaata 

.catctgatca cttttcgagc cgttttgtcg- ctagggactt titaatgaa? clgltgllll tlToTallll 

tttagaqcaa aaacaataat ^:acact•^i-^•rt =*^=.^4.4.-.-,^4. ^. ^'J^^". ^P^cgaattt 




cctcttttat 
cacaataatc 
tctttgattt 
agttttatca 
cattagtatc 
agaaacgggt 
tgaaaaatat 
cttctttttt 
tgtatccctc 



cacataaaat 
gtttttttQt 
cccagtgaaa 
tctcttcatc 
ctagtttttc 
ctcgtgacga 
gatctttaca 
gtcattttca 
ctccatcccc 



ctcggtcgaa 
tctttgccgt 
tacctcaccc 
atacccagtt 
atttgagccc 
aagaatacgt 
gttatttcta 
atttttctca 
agact 



accttattaa 
gtctcatttc 
acttca.atcc 
tgataattta 
ggagctcaga 
gcaccacacg 
ttatatcctc 
aatttctcag 



agccacataa 
attttgatct 
ccacaaagtg 
ttatctgatc 
ctacatctcc 
acccccccat 
aaatctctcg 
atctattctt 



•tliaaagataa 
actctttcct 
agcaacccct 
cccatcccct 
gaatcatcat 
•cctgttcacc 
taatatcgta 
tttcttgtat 



Sgaagc 
ttaattccgc 
cccttcggat 
atcttgcaac 
tgtcgcctct 
acaaatagat 
cccatacacc 
tcaatttcct 
ttttggaact 



T19E7,2c' ( 
MYTDSNNRNF 
PPASPFGRGP 
APLTAEENAR 
EDVSREEGQL 
VTDSQTVEQW 
QQTQTSPGSV 
PSSG5RYQRS 
NESLSEYQRQ 



conceptual t 
DEVNHQHQQE 
STERPTTSSR 
VEDLSKGFYN 
NQLFDNKQQH 
LPTEWPNDV 
TVTATATQSL 
SSPRSSQSSI 
LIRKIRRRGK 



ranslatibn) 
QDFNGQSKYD 
YEYSSPSLED 
GFFESFNNNQ 
PVINNVSLSE 
FPTSNYAYIG 
FDPYHSQRHS 
KIARWPLAS 
NKVAARTCRQ 



YPQFNRPMGL 
IDLIDVLWRS 
YQQKHQQQQR 

GIVYNQANLT 
MQNDSLQAW 
FSDCTTDSSS 
GQRKRGRQSK 
RRTDRHDKMS 



RWRDDQRMME 
DIAGEKGTRQ 
EQIKTPTLEH 
EMQEMRDSCN 
SNGQIDYDHS 
TCSRLSSESP 

vDEQLASDNEL 
HYI* 



YFMSNGPVET 
VAPADQYECD 
PTQKAELEDD 
QVSISTIPTT 
YQSTGQTPLS 
RYTSESSTGT 
PVSAFQISEM 



VPVMPILTEH 
LQTLTEKSTV 
LFDEDLAQLF 
STAQPETLFN 
PLIIGSSGRQ 
HESRFYGKLA 
SLSELQQVLK 
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T19E7.2b (spliced) 




agccacataa 
attttgatct 
ccacaaagtg 
ttatctgatc 
ctacatctcc 
acccccccat 
aaatctctcg 
atctattctt 
ctcttgtaca 



ttaaagataa 
actctttcct 
agcaaoccct 
cccatcccct 
gaatcatcat 
cctgttcacc 
taatatcgta 
tttcttgtat 
ttttcatata 



ttaattccgc 
cccttcggat 
atcttgcaac 
tgtcgcctct 
acaaatagat 
cccataca'cc 
tcaatttcct 
ttttggaact 
tgtccatata 



cctcttttat 
cacaataatc 
tctttgattt 
agttttatca 
cattagtatc 
agaaacgggt 
tgaaaaatat 
cttctttttt 
tgtatccctc 
tcgtttgaat 



cacataaaat 
gtttttttct 
cccagtgaaa. 
tctcttcatc 
ctagtttttc 
ctcgtgacga 
gatctttaca 
gtcattttca 
ctccatcccc 
ctctcattta 



ctcggtcgaa 
tctttgccgt 
tacctcaccc 
atacccagtt 
atttgagccc 
aagaatacgt 
gttatttcta 
atttttctca 
agacttcccc 



accttattaa 
gtctcatttc 
acttcaatcc 
tgataattta 
ggagctcaga 
gcaccacacg 
tta'tatcctc 
aatttctcag 
tfccccagtta 



• 4 

T19E7,2b (conceptual translation) 

ofi;^^^^^^^ LLASSTTTNT TNTAPAAVNS FDEQEEESKK ILNMYLQMFN 
fSn^^oy"^"^^ SNYAYIGMQN pSLQAWSNG QIDYDHSYQS TGQ^PLSpS 
ATATQSLFDP YHSQRHSFSD CTTDSSSTCS RLSSESPRYT SESSTGTHES 
RSSQSSIKIA RWPLASGQR KRGRQSKDEQ LASDNELPVS AFQISEMsS 
KIRRRGKNKY AARTCRQRRT DRHDKMSHYI * AFQISEMSLS 



QQQVDQHGHH 
IGSSGRQQQT 
RFYGKLAPSS 
ELQQVLKNES 



HQHPYAYSGV 
QTSPGSVTVT 
GSRYQRSSSP 
LSEYQRQLIR 
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■T19E7 . 2a (spliced) 




ataaaatctc 
tttttcttct 
agtgaaatac 
cttcatcata 
gtttttcatt 
gtgacgaaag 
ctttacagtt 
attttcaatt 
catccccaga 
tcatttatgg 



ggtcgaaacc 
ttgccgtgtc 
ctcacccact 
cccagtttga 
tgagcccgga 
aatacgtgca 
atttctatta 
tttctcaaat 
cttccccttc 
aaataaattt 



ttattaaagc 
tcatttcatt 
tcaatcccca 
taatttatta 
gctcagacta 
ccacacgacc 
tatcctcaaa 
ttctcag'atc 
ccagttactc 
gaaaaaatc 



cacataatta 
ttgatctact 
caaagtgagc 
tctgatcccc 
catctccgaa 
cccccatcct 
tctctcgtaa 
tattcttttt 
ttgtacattt 



aagataatta 
ctttcctccc 
aacccctatc 
atccccttgt 
tcatcataca 
gttcaccccc 
tatcgtatca 
cttgtatttt 
tcatatatgt 



gaagccct 
attccgccac 
ttcggattct 
ttgcaacagt 
cgcctctcat 
aatagataga 
atacacctga 
atttcctctt. 
tggaacttgt 
ccatatatcg 



cttttatcac 
aataatcgtt 
ttgatttccc 
tttatcatct 
tagtatccta 
aacgggtctc 
aaaatatgat 
cttttttgtc 
atccctcctc 
tttgaatctc 



T19E7.2a { 
MGGSSRRQRS 
SSYDRITTKH 
YFMSNGPVET 
VAPADQYECD 
PTQKAELEDD 
QVSISTIPTT 
YQSTGQTPLS 
RYTSESSTGT 
PVSAFQISEM 



conceptual translation) 
TSATRRDDKR RRRQCFSSVA- 
LLDNISPTFK MYTDSNNRNF 
VPVMPILTEH PPASPFGRGP 
LQTLTEKSTV APLTAEENAR 
LFDEDLAQLF EDVSREEGQL 
STAQPETLFN .VTDSQTVEQW 
PLIIGSSGRQ QQTQTSPGSV 
HESRFYGKLA PSSGSRYQRS 
SLSELQQVLK NESLSEYQRQ 



DDEBETTSIY 
DEVNHQHQQE 
STERPTTSSR 
YEDLSKGFYN 
NQLFDNKQQH 
LPTEWPNDV 
TVTATATQSL 
SSPRSSQSSI 
LIRKIRRRGK 



GVSSIFIWIL 
QDFNGQSKYD 
YEYSSPSLED 
GFFESFNNNQ 
PVINNVSLSE 
FPTSNYAYIG 
FDPYHSQRHS 
KIARWPLAS 
NKVAARTCRQ 



ATSSLILVIS 
YPQFNRPMGL 
IDLIDVLWRS 
YQQKHQQQQR 

GIVYNQANLT 

MQNDSLQAW. 

FSDCTTDSSS 

GQRKRGRQSK 

RRTDRHDKMS 



SPSSNTSIQS 
RWRDDQRMME 
DIAGEKGTRQ 

EQIKTPTLEH 
EMQEMRDSCN 
SNGQIDYDHS 
TCSRLSSESP * 

DEQLASDNEL 
HYI* 
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Human Glycogen synthase klnase-3 beta (6SK-3 beta) . 

1 msgrprttsf aesckpvqqp safgsmkvsr dkdgskvttv vatpgqgpdr pqevsytdtk 

. 61 vigngsfgw yqaklcdsge Ivaikkvlqd krfknrelqi mrkldhcniv rlryffyssg 

121 ekkdevylnl vldyvpetvy rvarhysrak qtlpviyvkl ymyqlfrsla yihsfgichr 

181 dikpqnllld pdtavlklcd f gsakqlvrg epnvsyicsr yyrapelifg- atdytssidv 

241 wsagcvlael llgqpifpgd sgvdqlveii kvlgtptreq iremnpnyte fkfpqikahp 

301 wtkvfrprtp peaialcsrl leytptarlt pleacahsff delrdpnvkl pngrdtpalf 
361 nfttqelssn pplatilipp. hariqaaast ptnataasda ntgdrgqtnn aasasasnst 
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Human Glycogen synthase kinase-3 alpha <6SK-3 alpha) . 

1 msgggpsggg pggsgrarts afaepggggg gggggpggsa sgpggtgggk asvgamgggv' 

-.t^ QffSfgsrSPg gsggggsggp gagtsfpppg vklgrdsgkv ttwatlgqg persqevayt 

ToJ **'-'^iS^f^ gvyyqarlae trelvaikkv Iqdkrfknre Iqiiarkldhc nivrlryffy 

^^'^^^ Inlvleyvpe tvyfvarhft kakltipily vkvymyqlfr slayihsqgv 

241 chrdikpqnl Ivdpdtavlk Icdfgsakql vrgepnvsyi csr^y^apel ifgatdy?^ 

IV; it^lt?^ aelllgqpif pgdsgvdqlv eiikvlgtpt reqiremnpn ytefkfpqik 

«*'P"t'«^f'^f rtppeaxalc sslleytpss rlspleacah sffdelrclg tqlpnnrplp 

• til siqpslnail ipphlrspag tttltpssqa Itetptssdw qstdatptlt 
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Mouse Glycogen s^^nthase klnase-S be1:a. 

1 msgrprttsf aesckpvqqp safgsmkvsr dkdgskvttv vatpgqgpdr pqevsytdtk 

61 vigngsfgw yqaklcdsge Ivaikkvlqd krfknrelqi mrkldhcniv rlryffyssg 

121 ekkdevylnl vldyvpetvy rvarhysrak qtlpviyvkl ymyqlfrsla yihsfgichr 

181 dikpqnllld pdtavlklcd fgsakqlvrg epnvsyicsr yyrapelifg atdytssidv 

241 wsagcvlael llgqpifpgd sgvdqlveii. kvlgtptreq iremnpnyte fkfpqikahp 

301 wtkvfrprtp peaialcsrl leytptarlt pleacahsff delrdpnvkl pngrdtpalf " 

361 nfttqelssn pplatilipp hariqaaasp panataasdt nagdrgqtnn aasasasnst 
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Mouse Glycogen synthase klnase-3 alpha {6SK-3 alpha) . 

m « 

1 masttamdvl eelssdssek qrsvnildsf vkdmferias easflarqar nstinsreiq 

1 f-ll Icrrgtgcgk asvwamgggv gasssgvggg sggpgstsf 1 qpgvklghds 

121 rkvttyvaty gqdpersqev actdikvign gsfgwyqew ladtrelvai kkvlqdkrfk 

181 yrelqamckl dhcnxvrlqy ffyssgekkd dlylnlvley vpetvyxvar hftkakliip 

241 iiyvkvymyq Ifrslayihs qgvchrdlnl Ivdpdtailk Icdfgsakql vlgttvapel 

Isi ^fvih^v 9^3^*!"^' qpifpgdngv dqlveiikvl gtptreqire mnpkytefkf 

361 pqikahpwtk vf ksrtaprp .Ihsalacwst hhtqgsphlr Ivptaslmnc gvsgpapqrp 
421 ptspcstsvl vicpsnhlsm pfssllt • 
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1 innkqllscsl ksgkqvtinvy asvatdgvdq qveisyydqk vigngsfgw flaklsttne 
61 mvaikkvlqd krf knrelqi nirklnhpniv klkyf f yssg ekkdelylnl ileyvpetvv 
12X rvarhyskqr qqipmiyvkl ymyqUrsla yihsigichr dikpqnllid pesgvlklcd 
181 fgsakylvrn epnvsyicsr yyrapelifg atnytnsidv wsagtvmael llgqpifpgd 
241 sgvdqlveil kvlgtptreq iqsinnpnyke fkfpqikahp wnkvfrvhtp aeaidliski 
301 xeytptsrpt pqaacqhaff delrnpdarl psgrplptle mdgpmgtgei sptsgdvagp 

•3 0X SSI ' 
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sgg-1 (GSKt3) inhibits constitutive SKN-1 nuclear 
accumulation and induction of its target gene gcs-L 




B. 



Control [^/o-7(RNAi)] 




Control [:g/o-i(RNAi)] 




-.■J. 

V ■ . • . A '•■ 
J*. ■ .' 



^^g-i(RNAi) 



5gg-i(RNAi) 
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Ala substitution at a predicted GSK-S phospliorylation 
site results in nuclear localization of SKN-1 




A. 



T68 T87 
T92 




B. 



GFP 




mmK 
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SKN-1 



393 

TTDSSSTCSRLSI^ES 



p-Catenin SGIHSGATTTAPSLS 




P 



P 



\ /. 



GSK-3 



P 

\ 

CKl 



Peptides: i- wr 



2. WT+P397 

3. S393A-I-P397 



393 397 

DCTTDSSSTCSRLSSESPRYTSE 

DCTTDSSSTCSRLSSESPRYTSE * 
DCTTDSSSTCARLSSESPRYTSE 



Assay: 



350000 



200000 



ISOOOO 



100000 



SOOOO 



WT+P397 
S393A.-fP397 
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